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This work addresses the intrinsic relationship between trees and networks (i.e. graphs). A com- 
plete (invertible) mapping is presented which allows trees to be mapped into weighted graphs and 
then backmapped into the original tree without loss of information. The extension of this method- 
ology to more general networks, including unweighted structures, is also discussed and illustrated. 
It is shown that the identified duality between trees and graphs underlies several key concepts and 
issues of current interest in complex networks, including comprehensive characterization of trees and 
community detection. For instance, additional information about tree structures (e.g. phylogenetic 
trees) can be immediately obtained by taking into account several off-the-shelf network measure- 
ments — such as the clustering coefficient, degree correlations and betweenness centrality. At the 
same time, the hierarchical structure of networks, including the respective communities, becomes 
clear when the network is represented in terms of the respective tree. Indeed, the network-tree 
mapping described in this work provides a simple and yet effective means of community detection. 



'All things must change to something new, to some- 
thing strange.' (H. W. Longfellow) 



I. INTRODUCTION 

Trees and general networks (or graphs) are seemingly 
rather distinct structures, the former characterized by 
a strict hierarchical organization and absence of cycles, 
while the latter are generally tangled and involve several 
cycles. Trees are frequently associated to systems under- 
lain by hierarchies, branching and taxonomies — includ- 
ing but not being limited to phylogenetics, vasculariza- 
tion, hydrography, and even neuronal shapes. Because of 
their deceptive simplicity (especially absence of cycles), 
trees are rarely investigated by considering topological 
measurements other than the number of levels, branches 
and symmetry (e.g P, On the other hand, networks 
or graphs are seemingly more general discrete structures, 
underlying a large number of natural complex systems 
ranging from the Internet to the brain (e.g. [HQ). A 
wealthy of measurements has been proposed and applied 
to characterize the topology of complex networks [5|, 
which can be found in two types: weighted and un- 
weighted, with the latter being a specific instance of the 
former Q. 

Though trees and weighted networks have been treated 
largely in independent fashion, there is an intrinsic and 
important relationship (duality) between these two seem- 
ingly different types of discrete structures that can lead 
to a number of interesting implications of theoretical 
and practical significance. To a limited extent, the du- 
ality tree-network has been sporadic and indirectly ex- 
plored in some works, especially those aimed at mapping 
the hierarchical/modular structure of complex networks 
(e.g. @,0|)- The understanding of trees as networks, how- 
ever, has received even less attention from the complex 
network community. Yet, provided a tree can be mean- 
ingfully mapped into a network, a series of comprehen- 



sive measurements can be immediately obtained which 
can potentially lead to a better understanding and mod- 
eling of the original trees, as illustrated in Figured] [f|. 
Given a tree and its respective limited set of measure- 
ments ft, it is mapped into a respective weighted graph 
which can then be characterized by a richer set of fea- 
tures \iT- The joint consideration of these two sets of 
measurements, as well as their difference A/i, provide a 
very comprehensive resource for analysing and classify- 
ing the original tree. Figure Q] [Bj also illustrates that 
the mapping from a discrete structure into a respective 
set of measurements can be invertible (representation) or 
not invertible (characterization). 

One of the main objectives of the present work is to 
describe a perfect duality between trees and respective 
weighted networks. It is also shown that such results can 
be extended to more general networks, i.e. those which 
can not be obtained by transformations of trees as well as 
unweighted structures. The implications of such a duality 
are many from both theoretic and applied points-of-view. 

This work starts by describing how trees can be 
mapped into respective weighted networks and then 
backmapped without any loss of information. A simple 
algorithm is presented and illustrated for that finality. 
We subsequently address the situation involving more 
general unweighted networks. 

II. BASIC CONCEPTS AND METHODS 

The transformation of a tree into a network, henceforth 
called tree-net mapping, is relatively simple and can be 
performed as follows. First, as illustrated in Figure [2j 
the nodes (leaves) of the same branch (from the bottom 
of the tree upwards) are connected with weights equal to 
one. Then, the nodes of the groups at the next level of 
the tree are connected with the vertices of other groups 
at the same level through edges of weight 2. Every ver- 
tex in each branch is connected to every vertex of the 
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FIG. 1: The transformation of the network into a tree can provide additional information for its characterization. 
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FIG. 2: Example of the adopted methodology to transform a 
tree into a network. 



other groups (a clique). This process is repeated until 
we reach the top of the tree. The last connections will 
have the maximum weight 2H, equal to twice the height 
of the three. Therefore, the resulting weighted networks 
will have their nodes interconnected in order to reflect 
the branching pattern of the original tree. The weight- 
ing scheme adopted above is justified in order to reflect 
the distances between nodes along the tree connectivity. 
Weighted networks that can be derived from trees by the 
above methodology are henceforth called tree-ancestered 
networks and abbreviated as TAN. 

The second type of transformation considered in this 
work, the so-called network-tree mapping, considers an 
agglomerative-thresholding method in order to backmap 



a TAN into the original tree. This can be achieved by 
using the following simple algorithm (please refer to Fig- 
ure [3|): 

1. The weakest edges are determined; 

2. Each respective group of vertices which belong 
to a connected component considering that edges 
weight are subsumed in the weighted matrix, with 
the new columns corresponding to the average be- 
tween the respective joined columns. 

These steps are repeated until the original tree is ob- 
tained. Figure [3] illustrates an application example of 
such methodology. This backmapping can be shown to 
be lossless, in the sense that given a TAN, its original 
respective tree can be perfectly recovered. 

While the previous pair of algorithms (i.e. tree-net and 
net-tree), allowing the transformation of any tree into a 
respective TAN and then backmapping it into the origi- 
nal tree, define a perfect duality between those two types 
of discrete structures, it is also interesting to consider the 
more general case of transforming any network, includ- 
ing unweighted graphs, into trees. This transformation is 
intrinsically related to community detection in complex 
networks (e.g. [1, Q). In the current work, instead of 
trying to obtain a respective tree corresponding to a un- 
weighted network, we first transform it into a weighted 
network by conside ring the shortest distances between 
each pair of nodes [lOL fTTL Il2l ]. The resulting network 
is fully connected and the edge weights between every 
pair of nodes i and j is equal to their respective shortest 
path distance. We considered the following algorithm to 
transform a generic unweighted network into the respec- 
tive tree: 

1. Compute the distance matrix of the network (l3| . 

2. Compute the Euclidean distance between every 
pair of vertices considering their respective columns 
in the distance matrix. 
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FIG. 3: Example of the methodology to transform a network 
into a tree. Note that the columns and rows of the distance 
matrix are joined at each step, while the new elements are 
equal to the average of the grouped columns. 



3. Identify the edges with the shortest Euclidean dis- 
tance (i.e. the smallest entries in the current weight 
matrix) . 

4. Join the edges which belong to respective connected 
components. 
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FIG. 4: A generic, unweighted network can be transformed 
into a weighted network with respect to the distances between 
all nodes and then into a tree by using the hierarchical ag- 
glomerative method proposed in the current work. 



5. Merge the respective vertices and put in the new 
column and row respective weight averages. 

Figure 0] illustrates the described methodology for a 
simple unweighted network. It is clear from this exam- 
ple that the two communities in the original network 
were clearly distinguished as the two main branches in 
the respectively obtained tree. It is important to ob- 
serve that the transformation of this tree into a net- 
work by using the first algorithm presented in this work 
will yield a network with fully connected communities 
(i.e. cliques) considering subsequence weights (i.e. dis- 
tances). Though the generic network-tree-network se- 
quence of transformations is typically not invertible, the 
therefore obtained weighted network represents an inter- 
esting structure which emphasizes the communities hid- 
den in the original unweighted network. By compar- 
ing the original, unweighted network, and the network 
obtained by the network-tree-network transformation, it 
becomes possible to identify the missing links among the 
nodes in each detected community. 
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III. CONCLUSION 

Trees and networks (or graphs) correspond to two of 
the most important and frequently discrete structures in 
physics, biology and computer science. Though tradi- 
tionally treated as quite different structures, they are ac- 
tually closely inter-related as shown in the present work. 
First, trees can be transformed into weighted networks 
(TANs) which can then be backmapped into the original 
tree by using simple respective algorithms. Such a perfect 
duality is important at least because of the two reasons: 
(i) by transforming trees into networks, it becomes pos- 
sible to characterize their topological features in terms 
of a wealthy of off-the-shelf complex networks measure- 
ments and (ii) the transformation of networks into 
trees is intrinsically related to community finding and 
the hierarchical organization of networks. 

We have also shown that even generic, unweighted net- 
works can be mapped into trees. This can be achieved by 
first transforming the original unweighted network into a 
weighted network, whose weights correspond to the dis- 
tances between the nodes in the original network. The 
respective tree is then obtained by progressive merging 



of nodes according to a simple agglomerative algorithm. 
The comparison between this network and the original, 
unweighted network has potential for highlighting the 
missing links amongst the nodes in each community. 

While the current work has been limited to motivat- 
ing and presenting the three algorithms for transforming 
between trees and networks, as well as the perfect dual- 
ity between trees and respectively transformed weighted 
networks, future developments can be performed in or- 
der to apply these results to real-word and theoretical 
complex networks. It would also be interesting to com- 
pare the performance of the generic unweighted network 
to tree transformation as a potentially effective and com- 
putationally parsimonious method. 
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